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Transgenic Cells Expressing Proteins Involved In Fattv Add Biosynthesis 



The invention relates to transgenic cells comprising nucleic acid molecules that 
comprise nucleic acid sequences which encode enzymes involved in the biosynthesis 
5 of n-3 fatty acids. 

DHA, an example of a n-3 fatty acid can be obtained directly from the diet or 
derived from metabolism of dietary linoleic and a-linolenic acid. To obtain 
sufficient amounts of this fatty acid humans have to eat foods rich in DHA. 

10 Currently the principle dietary source of DHA is fish or fish oil. However, this has 
many inherent problems; fish accumulate pollutants, the extracted oil has an 
unpleasant odour, there is a difficulty in controlling the proportion of specific 
desirable fatty acids from this source and since fish are a declining resource the 
market demand for DHA is not being met. Also, vegetarians do not have an obvious 

15 alternative food source to fish and therefore either do without DHA or have to take 
pure supplements. 

Long chain polyunsaturated fatty acids (LPUFAs) are derived from the essential fatty 
acids (EFA) linoleic acid (18:2n-6) and a-linolenic acid (18:3n-3), the parent 

20 compounds of the so-called omega-3 and omega-6 EFA families by an alternating 
series of desaturation and elongation reactions (Haag, 2001), see Figure 4. The major 
metabolite product of the n-6 pathway in mammals is arachidonic acid (AA) (20:4n- 
6), whilst the major end products of the n-3 pathway are eicosapentaenoic acid (EPA) 
(20:5n-3) and docosahexaenoic acid (DHA, 22:6n-3). The biosynthesis of 18:3n-3 

25 from 18:4n-3 involves the action of a A6 desaturase (Horrobin DF, 1992). This is 
followed by an elongation reaction to 20:4n-3 (Sprecher et aL, 1995) and a A5 
desaturation to 20:5n-3 (Sprecher et aL, 1995). The conventional view is that there is 
then a further elongation step converting 20:5n-3 to 22:5n-3, which is then followed 
by a final desaturation step involving the activity of a A4 desaturase to produce DHA 

30 (22:6n-3). 
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During evolution humans have consumed a diet containing approximately equal ratio 
of n-3 and n-6 essential fatty acids (1-2:1), but the last 100-150 years has seen a 
growing trend in Western diets towards the consumption of more n-6 fatty acids, 
resulting in an alteration of the ratio to 30:1 (Simonpolous, 1999), Whilst an 

5 increased intake of n-6 fatty acids is characterised by cardiovascular problems such 
as increased blood viscosity, vasospasm and vasoconstriction, the n-3 fatty acids are 
associated with health promoting properties. For example n-3 fatty acids have been 
described as anti-inflammatory, antithrombotic, antiarrhythmic, hypolipidemic and 
vasodilatory (Simonpolous, 1999). As such the role of DHA in the prevention and/or 

10 treatment of diseases such as coronary heart disease, hypertension, type n diabetes, 
ocular diseases, arthritis, cystic fibrosis and schizophrenia and has been the focus of a 
great deal of medical research. 

The effect of n-3 polyunsaturated fatty acids in the cardiovascular diseases has shown 
15 that dietary intake of DHA can lower the risk of myocardial infarction, hypertension 
and complications associated with cardiac surgery. A number of population studies 
have correlated the dietary intake of DHA with cardiovascular risk factors. For 
instance, a study of a population of Inuits in Canada (426 subjects aged 18-74 yr), 
who traditionally consume large amounts of marine foods rich in n-3 fatty acids, 
20 showed that n-3 fatty acids, such as DHA were positively associated with HDL- 
cholesterol concentrations and inversely associated with triacylglycerol 
concentrations and the ratio of total to HDL cholesterol (Dewailly et al 9 2001). It 
was concluded that the high dietary intake of n-3 fatty acids in the Inuit diet was 
probably responsible for the low mortality rate from ischemic heart disease in this 
25 population. 

Essential fatty acids are structural components of all tissues and are indispensable for 
cell membrane synthesis. The brain, retina and other neural tissues have been found 
to be particularly rich in DHA, where it is involved in neural development and 
30 maturation of sensory systems (Uauy et al. 9 2000). A large body of research 
comparing infants fed with breast milk compared to formula milk, which is deficient 
in DHA and other omega 3- fatty acids, has concluded that the presence of DHA is 
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critical during the development of the newborn (Horrocks et al t 1999). DHA forms 
25% of the fatty acid complement of the glycosphingolipids of the brain and is an 
important component of the rods of the retina, and therefore a deficiency in DHA 
during infant development has been associated with a reduction in cognitive function 
5 and visual acuity. Furthermore, deficiencies in DHA have been associated with foetal 
alcohol syndrome, attention deficit hyperactivity disorder, cystic fibrosis, 
phenylketonuria and adrenoleukodystrophy. 

To meet this increased demand for n-3 fatty acids such as DHA a number of 
10 approaches have been attempted. Methods to enhance the DHA content of meat by 
manipulating animal feed have been met with little success. The cultivation of marine 
micro-organisms such as the Crypthecodinium cohnii and Schizochytrium sp, which 
are rich sources of DHA has also met with some limited success as the cultivation of 
algae is technically demanding and costly (Ashford et al 9 2000). 

15 

There has been limited research focused on the identification of genes involved in 
the biosynthesis of n-3 fatty acids in algae. In one report the identification of a cDNA 
encoding a novel CI 8- A 9 polyunsaturated fatty acid-specific elongating activity from 
the docosahexaenoic acid (DHA)-producing microalga, Isochrysis galbana was 

20 described (Qi et al, 2002). This 30 kDa elongase, designated IgASEl, shares only 
limited homology to animal and fungal proteins with elongating activity. When 
IgASEl was expressed in the yeast Saccharomyces cerevisiae, it was shown to 
specifically elongate the C18- A 9 polyunsaturated fatty acids, linoleic acid (C18:2n- 
6, A 9,12 and alpha-linolenic acid (C18:3, A 9 ' 12 ' 15 ), to eicosadienoic acid (C20:2, A 11,14 ) 

25 and eicosatrienoic acid (C20:3 A 11,14,17 ), respectively. It was concluded that a major 
route for eicosapentaenoic acid (C20:5 A 5,8,11,14,17 ) and docosahexaenoic acid 
(C22:6 A 4 » 7 ' 10 » 13 » 16 » 19 ) syntheses in L galbana may involve a A 8 desaturation pathway. 
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A6 and A5 desaturases are microsomal enzymes that are thought to be a component 
of a three-enzyme system that includes NADH-cytochrome b$ reductase, cytochrome 
bs> and the respective desaturase (Sprecher, 1981). 

S A number of A6 and AS desaturases have been identified. In plants such as the herb, 
borage (Borago officinalis)* the A6 desaturase has been identified (Sayanova et al 9 
1997). A6 and A5 desaturases have been identified in humans (Hyekyung et aL 9 1999 
and Cho et ah 9 1999, respectively), in animals such as the nematode; Caenorhabditis 
elegans (Michaelson et al 9 1998 and Napier et al 9 1998) and in eukaryotic 

10 microorganisms such as the fungasMortierella alpina (Huang et al. 9 1999 and 
Knutzon et al 9 1998). In the human, A6 and AS desaturase activities have been found 
in skeletal muscle, lung, placenta, kidney and pancreas, but are expressed at the 
highest levels in the liver, brain and heart (Hyekyung et al 9 1999). In all these tissues 
however, A6 desaturase activity was found to be higher than that of AS desaturase. 

IS The genes for both of the enzymes reside on chromosome 1 1, in a reverse orientation, 
being separated by <1 1,000 base pairs (Hyekyung et al. 9 1999). A A4 desaturase that 
can introduce a double bond at position 4 of 22:5 n-3 and 22:4 n-6 resulting in the 
production of DHA and docosapentanoic acid has been identified in the marine fungi 
Thraustochytrium sp (Qiu et a/., 2001). 

20 

Cellular storage of fatty acids in triacylglycerol requires that the fatty acids are first 
activated to their acyl-CoA esters through the action of acyl-CoA synthetase enzyme. 
Acyl-CoA's are produced by acyl-CoA synthetase from fatty acid, ATP and 
Coenzyme A. Acyl-CoA synthetases can exhibit substrate specificity for different 
25 chain length or different degrees of saturation of the fatty acid. For example an 
arachidonate (20:4 n-6)-preferring acyl-CoA synthetase has been identified in rat 
(Kang et al 9 1997). This enzyme has a high affinity for arachidonate and EPA and 
low affinity for palmitate. Several isoforms of acyl-CoA synthetases have also been 
identified inArabidopsis (Schnurr et al 9 2000). 

30 
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Acyl CoA:diacyglyccrol acyltransfcrase (DGAT) catalyses the final enzymatic step 
in the production of triacylglycerols in plants, fungi and mammals . The ezyme is 
responsible for transferring an acyl group from acyl-CoA to the sn-3 position of 1,2- 
diacylglycerol (DAG) to fonn triacylglycerol (TAG), The first cloning of a DGAT 

5 gene was from mouse (Cases et al, 1998). An Arabidopsis homologue of the mouse 
DGAT gene was subsequently reported and found to be present as a single copy gene 
(Hobbs et a/., 1999). Jako et a/., (2001) showed that the Arabidopsis Tag! mutant 
which is disrupted in the DGAT gene and has a fatty acid and reduced oil phenotype 
can be complemented by expression of the DGAT cDNA. Jako et al> (2001) also 

10 showed that seed-specific over-expression of the DGAT cDNA in wild-type 
Arabidopsis enhances oil deposition and average seed weight thus confirming the 
important role of DGAT in regulating the quantity of seed triacylglycerols and the 
sink size in developing seeds. Protein purification based studies on the oleaginous 
fungus Mortierella ramanniana resulted in the identification of a second class of 

15 proteins involved in TAG production that are encoded by the DGAT2 gene family 
that are unrelated to the previously identified DGAT1 gene family (Lardizabal et al. 9 
2001). A human homologue of the Mortierella ramanniana DGAT2 gene has been 
also been identified (Cases et a/., 2001). Substrate specifities of the different families 
have yet to be determined. 

20 

Whilst higher plants do not typically biosynthesise LPUFAs such as DHA, they are 
an attractive target for genetic manipulation, particularly the low cost production of 
DHA in the vegetable oil of a crop such as oilseed rape. There have been no reports 
of higher plants that biosynthesise DHA, a number of attempts to introduce algal 
25 genes in order to manipulate the biosynthetic capacity of oil seed plants that produce 
LPUFAs have been reported. These have included the introduction of desaturases 
into transgenic plants to increase the production of DHA, EPA and also stearidonic 
acid (18:4n-3). 

30 We herein disclose nucleic acid molecules which encode enzymes involved in n-3 
fatty acid metabolism and the manipulation of these sequences and the biochemical 
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20 



pathways which comprise enzymes encoded by these sequences, to provide an 
alternative dietary source of n-3 fatty acids and, in particular, DHA. The sequences 
encode n-3 fatty acid elongase,desaturase, acyl CoA synthetase and diacylglycerol 
acyltransferase activities. 



According to an aspect of the invention there is provided a transgenic cell comprising 
a nucleic acid molecule comprising a nucleic acid sequence selected from the group 
consisting of: 

(i) a DNA molecule consisting of a DNA sequence as represented in 
10 Figures la, lb or lc; 

(ii) a DNA molecule which hybridises to the sequences identified in (i) 
above and which encode a polypeptide which has fatty acid elongase 
activity; and 

(iii) DNA molecules consisting of DNA sequences that are degenerate as a 
15 result of the genetic code to the DNA sequence defined in (i) and (ii) 

In a preferred embodiment of the invention said nucleic acid molecule anneals under 
stringent hybridisation conditions to the sequences described in (i), (ii) and (iii) 
above. 



In a preferred embodiment of the invention said nucleic acid molecules are isolated 
from an algal species. 



Preferably said algal species is selected from the group consisting of: Amphidinium 
25 carterae, Amphiphora hyalina, Amphiphora sp., Chaetoceros gracilis, Coscinodiscus 
sp., Crypthecodinium cohnii, Cryptomonas sp., Cylindrotheca jusiformis, Haslea 
ostrearia, Isochrysis galbana, Nannochloropsis oculata, Navicula sp., Nitzschia 
closterium, Pavlova lutheri, Phaeodactylum tricornutum, Prorocentrum minimum, 
Rhizosolenia setigera, Skeletonema costatum, Skeletonema sp., Tetraselmis 
30 tetrathele, Thalassiosira nitzschioides, Thalassiosira heterophorma, Thalassiosira 
pseudonana, Thalassiosira stellaris. 
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In a further preferred embodiment of the invention said polypeptide is a variant 
polypeptide and comprises the amino acid sequence represented in Figure 2a, 2b, or 
2c which sequence has been modified by deletion, addition or substitution of at least 
5 one amino acid residue wherein said modification enhances the enzyme activity of 
said polypeptide. 

A variant polypeptide may differ in amino acid sequence by one or more 
substitutions, additions, deletions, truncations that may be present in any 

10 combination. Among preferred variants are those that vary from a reference 
polypeptide by conservative amino acid substitutions. Such substitutions are those 
that substitute a given amino acid by another amino acid of like characteristics. The 
following non-limiting list of amino acids are considered conservative replacements 
(similar): a) alanine, serine, and threonine; b) glutamic acid and aspartic acid; c) 

IS asparagine and glutamine d) arginine and lysine; e) isoleucine, leucine, methionine 
and valine and f) phenylalanine, tyrosine and tryptophan. Most highly preferred are 
variants that retain or enhance the same biological function and activity as the 
reference polypeptide from which it varies. 

20 A functionally equivalent polypeptide(s) according to the invention is a variant 
wherein one in which one or more amino acid residues are substituted with conserved 
or non-conserved amino acid residues, or one in which one or more amino acid 
residues includes a substituent group. Conservative substitutions are the 
replacements, one for another, among the aliphatic amino acids Ala, Val, Leu and Be; 

25 interchange of the hydroxyl residues Ser and Thr; exchange of the acidic residues 
Asp and Glu; substitution between amide residues Asn and Gin; exchange of the 
basic residues Lys and Arg; and replacements among aromatic residues Phe and Tyr. 

In addition, the invention features polypeptide sequences having at least 75% identity 
with the polypeptide sequences as herein disclosed, or fragments and functionally 
30 equivalent polypeptides thereof. In one embodiment, the polypeptides have at least 
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85% identity, more preferably at least 90% identity, even more preferably at least 
95% identity, still more preferably at least 97% identity, and most preferably at least 
99% identity with the amino acid sequences illustrated herein. 

5 Ideally said modified polypeptide has enhanced fatty acid elongase activity 

In a further preferred embodiment of the invention said polypeptide comprises the 
amino acid sequence represented in Figures 2a, 2b or 2c. Preferably said polypeptide 
consists of the amino acid sequence represented in Figures 2a, 2b or 2c. 

10 

According to a further aspect of the invention there is provided a vector including at 
least one nucleic acid molecule wherein said nucleic acid molecule is selected from 
the group consisting of: 

15 i) a DNA molecule consisting of a DNA sequence as represented in 

Figures la, lb or 1c; 
ii) a DNA molecule which hybridises to the sequences identified in (i) 
above and which encode a polypeptide which has fatty acid elongase 
activity, and 

20 iii) DNA molecules consisting of DNA sequences that are degenerate as a 

result of the genetic code to the DNA sequence defined in (i) and (ii) 

A vector including nucleic acid (s) according to the invention need not include a 
promoter or other regulatory sequence, particularly if the vector is to be used to 
25 introduce the nucleic acid into cells for recombination into the genome for stable 
transfection. 

Preferably the nucleic acid in the vector is operably linked to an appropriate promoter 
or other regulatory elements for transcription in a host cell such as a prokaryotic, (e.g. 
30 bacterial), or eukaryotic (e.g. fungal, plant, mammalian or insect cell). The vector 
may be a bi-functional expression vector which functions in multiple hosts. In the 
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example of nucleic acids encoding polypeptides according to the invention this may 
contain its native promoter or other regulatory elements and in the case of cDNA this 
may be under the control of an appropriate promoter or other regulatory elements for 
expression in the host cell. 

5 

By "promoter" is meant a nucleotide sequence upstream from the transcriptional 
initiation site and which contains all the regulatory regions required for transcription. 
Suitable promoters include constitutive, tissue-specific, inducible, developmental or 
other promoters for expression in plant cells comprised in plants depending on 
10 design. Such promoters include viral, fungal, bacterial, animal and plant-derived 
promoters capable of functioning in plant cells. 

Constitutive promoters include, for example CaMV 35S promoter (Odell et al (1985) 
Nature 313, 9810-812); rice actin (McElroy et al (1990) Plant Cell 2: 163-171); 
15 ubiquitin (Christian et al . (1989) Plant Mol. Biol. 18 (675-689); pEMU (Last et al 
(1991) Theor Appl. Genet. 81: 581-588); MAS (Velten et al (1984) EMBO J. 3. 
2723-2730); ALS promoter (U.S. Application Seriel No. 08/409,297), and the like. 
Other constitutive promoters include those in U.S. Patent Nos. 5,608,149; 5,608,144; 
5,604,121; 5,569,597; 5,466,785; 5,399,680, 5,268,463; and 5,608,142. 

20 

Chemical-regulated promoters can be used to modulate the expression of a gene in a 
plant through the application of an exogenous chemical regulator. Depending upon 
the objective, the promoter may be a chemical-inducible promoter, where application 
of the chemical induced gene expression, or a chemical-repressible promoter, where 

25 application of the chemical represses gene expression. Chemical-inducible 
promoters are known in the art and include, but are not limited to, the maize Ih2-2 
promoter, which is activated by benzenesulfonamide herbicide safeners, the maize 
GST promoter, which is activated by hydrophobic electrophilic compounds that are 
used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated 

30 by salicylic acid. Other chemical-regulated promoters of interest include steroid- 
responsive promoters (see, for example, the glucocorticoid-inducible promoter in 
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Schena et al (1991) Proc. Natl. Acad. Sci. USA 88: 10421-10425 and McNellie et al 
"(1998) Plant J. 14(2): 247-257) and tetracycline-inducible and tetracycline- 
repressible promoters (see, for example, Gatz et al (1991) Mol. Gen. Genet 227: 
229-237, and US Patent Nos. 5,814,618 and 5,789,156, herein incorporated by 
5 reference. 

Where enhanced expression in particular tissues is desired, tissue-specific promoters 
can be utilised. Tissue-specific promoters include those described by Yamamoto et 
al (1997) Plant J. 12(2): 255-265; Kawamata et al (1997) Plant Cell Physiol. 38(7): 

10 792-803; Hansen et al (1997) Mol. Gen. Genet. 254(3): 337-343; Russell et al 
(1997) Transgenic Res. 6(2): 157-168; Rinehart et al (1996) Plant Physiol. 112(3): 
1331-1341; Van Camp et al (1996) Plant Physiol. 112(2): 525-535; Canevascni et al 
(1996) Plant Physiol. 112(2): 513-524; Yamamoto et al (1994) Plant Cell Physiol. 
35(5): 773-778; Lam (1994) Results Probl. Cell Differ. 20: 181-196; Orozco et al 

15 (1993) Plant Mol. Biol. 23(6): 1129-1138; Mutsuoka et al (1993) Proc. Natl. Acad. 
Sci. USA 90(20): 9586-9590; and Guevara-Garcia et al (1993) Plant J. 4<3): 495-50. 

In a preferred embodiment of the invention said tissue specific promoter is a 
promoter which is active during the accumulation of oil in developing oil seeds, see 
20 Broun et al (1998) Plant J. 13(2): 201-210. 

"Operably linked" means joined as part of the same nucleic acid molecule, suitably 
positioned and oriented for transcription to be initiated from the promoter. DNA 
operably linked to a promoter is "under transcriptional initiation regulation" of the 
25 promoter. 

In a preferred embodiment the promoter is an inducible promoter or a 
developmentally regulated promoter. 

30 Particular vectors are nucleic acid constructs which operate as plant vectors. Specific 
procedures and vectors previously used with wide success upon plants are described 
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by Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. 
In: Plant Molecular Biology Labfex (Croy RRD ed) Oxford, BIOS Scientific 
Publishers, pp 121-148. Suitable vectors may include plant viral-derived vectors (see 
e.g. EP-A-194809). 

5 

Vectors may also include selectable genetic marker such as those that confer 
selectable phenotypes such as resistance to herbicides (e.g. kanamycin, hygromycin, 
phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, 
imidazolinones and glyphosate). 

10 

Alternatively, or in addition, said vectors are vectors suitable for mammalian cell 
transfection or yeast cell transfection. In the latter example multi-copy vectors such 
as 2jx episomal vectors are preferred. Alternatively yeast CEN vectors and 
intergrating vectors such as YIP vectors are suitable for transformation of yeast 
1 5 species such as Saccharomyces cerevisiae and Pichia spp. 

It will be apparent to one skilled in the art that a vector according to the invention 
may include nucleic acid molecules encoding different enzyme activities to facilitate 
the delivery of different enzyme activities to a transfected or transformed cell to 
20 reconstitute enzymic pathways. 

In a preferred embodiment of the invention said cell is transformed with nucleic acid 
molecules selected from the group consisting of nucleic acid sequences encoding 
elongase and desaturase and/or acyl-CoA synthetase and/or diacylglycerol 
25 acyltransferase activities to provide a cell in which at least part of a 3-n fatty acid 
biosynthetic pathway is reconstituted. 

In a further preferred embodiment of the invention said cell is transfected with a 
nucleic acid molecules selected from the group comprising nucleic acid sequences 
30 selected from the group consisting of: 

i) a DNA molecule consisting of the DNA sequence as represented in 
Figures la, lb or lc; 
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ii) DNA molecules which hybridise to the sequences identified in (i) 
above and which encode a polypeptide which has fatty acid elongase 
activity; and 

iii) DNA molecules comprising DNA sequences that are degenerate as a 
5 result of the genetic code to the DNA sequence defined in (i) and (ii); 

combined with at least one of the nucleic acid molecules selected from 
the group consisting of, 

iv) DNA molecules consisting of DNA sequences as represented in 
Figures 3a, 4a, 5a or 6a; 

10 v) DNA molecules which hybridise to the sequences identified in (iv) 

above and which have desaturase, acyl-CoA synthetase or 
diacylglycerol acyltransferase activity; 
vi) DNA molecules comprising DNA sequences that are degenerate as a 
result of the genetic code to the DNA sequence defined in (iv) and (v) 

15 above. 



In a preferred embodiment of the invention said cell is selected from the group 
consisting of: mammalian cells (e.g Chinese Hamster Ovary cells); yeast cells (e.g. 
Saccharomyces spp, Pichia spp); algal cells (e.g Phaeodactylum tricornutum, 
20 Chlamydomonas reirihardtii ); plant cells. 



In a preferred embodiment of the invention said cell is a plant cell. 



According to a further aspect of the invention there is provided a plant comprising a 
25 cell according to the invention. 

In a preferred embodiment of the invention said plant is selected from: com (Zea 
mays), canola (Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum), 
alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cerate), sorghum {Sorghum 
30 bicolor, Sorghum vulgare), sunflower (Helianthus annus), wheat (Tritium aestivum), 
soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), 
peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (Iopmoea 
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batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), 
pineapple (Anana comosus), citris tree (Citrus spp.) cocoa (Theobroma cacao), tea 
(Camellia senensis), banana (Musa spp.), avacado (Persea americana), fig (Ficus 
casica), guava (Psidium guajava), mango (Mangifer indica), olive (Olea europaea), 
5 papaya (Carica papaya), cashew (Anacardium occidental^, macadamia (Macadamia 
intergrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), oats, barley, 
vegetables and ornamentals. 

Preferably, plants of the present invention are crop plants (for example, cereals and 
10 pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), 
and other root, tuber or seed crops. Important seed crops are oil-seed rape, sugar 
beet, maize, sunflower, soybean,sorghum, and flax (linseed). Horticultural plants to 
which the present invention may be applied may include lettuce, endive, and 
vegetable brassicas including cabbage, broccoli, and cauliflower. The present 
15 invention may be applied in tobacco, cucuibits, carrot, strawberry, sunflower, tomato, 
pepper. 

Grain plants that provide seeds of interest include oil-seed plants and leguminous 
plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, 
20 sorghum, rye, etc. Oil-seed plants include cotton, soybean, safflower, sunflower, 
Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and 
peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, 
mungbean, lima bean, favabeen, lentils, chickpea, etc. 

25 It will be apparent that transgenic plants adapted for the production of n-3 fatty 
acids, in particular DHA, can either be eaten directly or used as a source for the 
extraction of essential fatty acid, of which DHA would be a constituent. 

According to a yet further aspect of the invention there is provided a seed 
30 comprising a cell according to the invention. 
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In a further preferred embodiment of the invention said cell is a yeast cell, preferably 
of the genus Saccharomyces spp f preferably Brewer's yeast Saccharomyces 
cerevistae. 

5 The genus Saccharomyces spp is used in both brewing of beer and wine making and 
also as an agent in baking, particularly bread. Yeast is a major constituent of 
vegetable extracts of which Marmite* 1 is a typical example. Yeast is also used as an 
additive in animal feed. It will be apparent that genetically engineered yeast strains 
can be provided which are adapted to synthesise n-3 fatty acids. These yeast strains 

10 can then be used in food stuffs and in wine and beer making to provide products 
which have enhanced n-3 fatty acid content and in particular DHA content. 

According to a further aspect of the invention there is provided a foodstuff product 
comprising a yeast cell according to the invention. 

15 

In a preferred embodiment of the invention said foodstuff product is selected from 
the group consisting of: wine; beer; bread, baking products (e.g. bread, cake); 
vegetable extracts. 

20 In a further preferred embodiment of the invention said wine or beer in non- 
alcoholic. 

According to a further aspect of the invention there is provided a fermentation 
process comprising a yeast cell according to the invention. 

25 

In a preferred embodiment of the invention said fermentation process comprises the 
steps of : 

i) providing a vessel containing a yeast cell according to the invention and 
30 constituents required for fermentation and fatty acid biosynthesis; and 
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ii) providing conditions conducive to the fermentation of the liquid composition 
contained in said vessel. 

According to a yet further aspect of the invention there is provided an animal feed 
5 product comprising a cell according to the invention. 

In a preferred embodiment of the invention said cell is a plant cell or yeast cell. 

According to a further aspect of the invention there is provided a method of 
10 modulating the level of n-3 fatty acid, in particular DHA, or variants thereof, in a 
plant cell comprising; 

i) providing a plant cell according to the invention; 

ii) regenerating the plant cell into a plant; and 

1 5 iii) monitoring n-3 fatty acid production by said plant. 

According to a further aspect of the invention there is provided a method for the 
production and optionally the extraction of n-3 fatty acids, in particular DHA, 
comprising: 

20 

i) providing a cell according to the invention; 

ii) providing conditions conducive to the growth of said cell; and 

iii) extracting n-3 fatty acids, or variants thereof, from said cell. 

25 According to a yet further aspect of the invention there is provided a method for the 
production and optionally the extraction of n-3 fatty acid, particularly DHA, 
comprising: 

i) providing a plant cell according to the invention; 
30 ii) regenerating said cell into a plant; and 

iii) extracting n-3 fatty acids, or variants thereof from said plant. 
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According to a further aspect of the invention there is provided n-3 fatty acids, 
particularly DHA, or variants thereof obtainable by the method(s) according to the 
invention. 

5 

In a preferred embodiment of the invention said n-3 fatty acid, or variant thereof, is 
for use as a pharmaceutical. 

In a further preferred embodiment of the invention said n-3 fatty acid, or variant 
10 thereof, is for use in the manufacture of a medicament for use in the treatment of 
conditions which would benefit from administration of n-3 fatty acids, or variant 
thereof. 

In a preferred embodiment of the invention said condition is selected from the group 
15 consisting of: cardiac arrhythmia's; rheumatoid arthritis; Crohn's disease; 
schizophrenia; cancer; foetal alcohol syndrome; attention deficient hyperactivity 
disorder; cystic fibrosis; phenylketonuria; unipolar depression; aggressive hostility; 
adrenoleukodystophy, coronary heart disease, hypertension, type II diabetes, ocular 
diseases. 

20 

According to a further aspect of the invention there is provided a non-human 
transgenic animal comprising at least one nucleic acid molecule according to the 
invention. 

25 According to a yet further aspect of the invention there is provided a reaction vessel 
comprising at least one polypeptide according to the invention, fatty acid substrates 
and co-factors characterised in that said vessel is adapted for the conversion of said 
fatty acids substrates to n-3 fatty acids, in particular docosahexaenoic acid. 

30 In a preferred embodiment of the invention said vessel comprises polypeptides 
having elongase, desaturase, acyl-CoA synthetase and diacylglycerol acyltransferase 
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activities to provide a vessel in which at least part of a 3-n fatty acid biosynthetic 
pathway is reconstituted. 

In a further preferred embodiment of the invention said polypeptides are those 
5 protein molecules disclosed herein. In particular, protein molecules which comprise 
the sequences as represented by Figures 2a, 2b, 2c, 3b, 4b, Sb or 6b. 

In a preferred embodiment of the invention said at least one polypeptide is expressed 
by a cell according to the invention. 

10 

In a preferred embodiment of the invention said polypeptide(s) is/are soluble. 
Alternatively said polypeptide^) is/are immobilised. 

In a further preferred embodiment of the invention said vessel is a bioreactor. 

15 

It will be apparent to one skilled in the art that a polypeptide according to the 
invention has utility with respect to the in vivo biosynthesis of n-3 fatty acids through 
transformation or transfection of nucleic acids encoding said polypeptide(s) into 
suitable host cells. Fatty acids can then either be extracted from said cells or foods 

20 comprising said cells can be eaten. Cells expressing said polypeptide (s) can also be 
incubated under suitable growth conditions to facilitate the synthesis of fatty acids. 
Alternatively, said polypeptide (s) can either be purified from an algal cell culture or 
manufactured recombinantly and used in a bioreactor to synthesise fatty acids in 
vitro. It will also be apparent that the invention involves, inter alia, the reconstitution 

25 of at least part of an algal n-3 fatty acid biosynthetic pathway which, either in a cell 
or in vitro, provides for a source of n-3 fatty acids which is an alternative to either 
the exploitation of algae in bioreactors or the consumption of fish. 

An embodiment of the invention will now be described by example only and with 
30 reference to the following figures: 
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Figure la represents the nucleic acid sequence of a nucleic acid molecule comprising 
a fatty acid elongase TpELQ2.1; Figure lb the nucleic acid sequence of the fatty acid 
elongase TpEL02.2; Figure 1c the nucleic acid sequence of the fatty acid elongase 
TpEL02.3;. 

5 

Figure 2a represents the amino acid sequence of TpEL02.1; Figure 2b represents the 
amino acid sequence of TpEL02.2; and ; Figure 2b represents the amino acid 
sequence of TpEL02.3; 

10 Figure 3a represents the nucleic acid sequence oiPIDESl\ Figure 3b represents the 
amino acid sequence of PIDES1;. 

Figure 4a represents the nucleic acid sequence of a nucleic acid molecule comprising 
fatty acid desaturase, PIDES2; Figure 4b the amino acid sequence comprising 
15 PIDES2; 

Figure 5a represents the nucleic acid sequence of a nucleic acid molecule comprising 
acyl-CoA synthetase, PIACS1; Figure 5b the amino acid sequence comprising 
PIACS1; 

20 

Figure 6a the foil length sequence of a nucleic acid molecule encoding PIDGAT2-1; 
Figure 6b the full length amino acid sequence of PIDGAT2-1 polypeptide; and 

Figure 7a is the nucleic acid sequence of PlELOl; Figure 7b amino acid sequence of 
25 P1ELO 1 ; Figure 7c is the nucleic acid sequence of P1ELO 2; Figure 7d is the amino 
acid sequence of P1ELO 2. 



30 
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Cultivation of Pavlova lutherl 

5 Pavlova lutheri (CCAP 93 1/1) was obtained from the Culture Collection of Algae 
and Protozoa (Dunstaffiiage Marine Lab., Oban, PA34 4 AD, Scotland, U.K.). 

The growth medium used in all experiments was enriched artificial seawater medium 
(EASW), made up in 20 1 batches as described by Harrison et al (1980), and 

10 modified by Thompson et al (1991). The medium was further modified by 
increasing the macronutrient concentrations of NaNC>3 and Na2Si03.9H20 to 1 mM, 
and NaH2PC>4 to 200 \iM. The silicate was dissolved separately in deionized distilled 
water and the pH adjusted to approximately 8.0 with 50% HC1 before it was added to 
the medium. This medium was buffered to pH 8.0 by adding 20 mM N-[2- 

15 hydroxyethyllpiperazine-N'-P-ethanesulfonic acid] (HEPES) and 20 mM NaOH. 
The freshly prepared medium was filtered through a 0.22 jiM Millipore™ GS 
membrane filter into a 20 1 sterile propylene reservoir. It was then dispatched by 0.5 1 
in 1 1 conical glass flasks and sterilized by autoclaving (30 min, 120°C). The batch 
cultures were grown at 15°C with 50 jiE in 2 s" 1 constant illumination, and aeration 

20 provided by shaking the flasks at 150 rpm. 

Cell density was monitored by counting cells with a haemacytometer. Since the 
Pavlova lutheri cells are motile, they were first incubated in sodium azide 20 mM to 
immobilise before counting. 

25 

The nitrate concentration was determined periodically during the culture time by 
measuring the change of the medium absoibance at 220 nm, according to the method 
described by Collos et al. (1999). 

30 
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Isolation of total and polv(A>+ RNA from P. lutherl 

The algal culture was harvested by centrifugation at 4,500 rpm for 15 min. The cell 
pellet was suspended in 1 volume of extraction buffer (25 mM Tris-HCl pH 8.0, 25 

5 mM EDTA pH 8.0, 75 mM NaCl, 1 % SDS v/v, 7.8 % P-mercaptoethanol v/v, in 
DEPC treated water), and one volume of 25:24:1 phenol :chloroform:isoamyl alcohol 
(v/v) was added. After 13,000 rpm centrifugation at 4 °C for 10 min, the aqueous 
phase was transferred to a new tube and 1 volume of 24:1 chloroform:isoamyl 
alcohol v/v added. After a second round of centrifugation, the upper phase was 

10 transferred to a fresh tube and LiCl added to a final concentration of 2 M. This 
solution was incubated for Ihr at -20 °C, and then centrifuged at 13,000 rpm at 4 °C 
for 15 min. The resulting pellet was re-suspended in DEPC treated water and the 
RNA was precipitated by addition of 0.1 volume of 3 M sodium acetate, pH 5.5 and 
2.5 volumes of absolute ethanol followed by incubation for 20 min at 4 °C. This 

15 sample was then centrifuged at 13,000 rpm at 4 °C for 15 min and the resulting pellet 
was washed with 70 % ethanol, dried and re-suspended in DEPC treated water. 
Quantity and quality of the extract were estimated by measuring optical density at 
260 and 280 nm (1 O.D. 2 6o nm = 40 ng/ml RNA). An aliquot of the extract was also 
visualised on a 1 % (w/v) agarose gel containing ethidium bromide. 

20 

For the cDNA library construction, poly(A) + RNA was prepared with the Poly(A) 
Quick® mRNA isolation kit (Stratagene) from cells harvested during the exponential 
phase. 

25 cDNA library construction and oBluescript phagemid excision 

Double-stranded, end-adapted cDNA synthesised using a cDNA synthesis kit 
(Stratagene) was passed through a Sepharose CL-2B gel filtration column 
(Stratagene) to remove adapters and small cDNA molecules. cDNA eluted from the 
30 column was phenol-extracted, ethanol-precipitated and ligated to arms of the Uni- 
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ZAP XR Vector (Stratagene) before packaging into X phage using the Gigapack III 
Gold Packaging Extract (Stratagene). A primary library of 3 X 10* plaque forming 
units (pfu) was obtained with the majority of the inserts examined ranging from 0.3 
to 1.5 kb. The library was subsequently amplified. 

5 

After amplification, the cDNA library was excised using the Ex Assist™ Interference- 
Resistant Helper Phage (Stratagene). The excised phagemids were plated as 
individual bacterial colonies following the manufacturer's instructions. The presence 
of insert was checked by PCR using universal primers and clones containing cDNA 
10 longer than 0.2 kb were selected for sequencing. 

Sequencing and sequence analysts 

Sequencing reactions were prepared with the ABI Prism Big Dye-Teiminator cycle 
sequencing kit (PE Applied Biosystems), using the universal T3 primer, and these 

15 were run on an ABD700 (96-capillaries) sequencer (PE Applied Biosystems). The 
resulting Expressed Sequence Tags (ESTs) were processed such that all vector 
sequences were removed and further examined using standard nucleotide sequence 
alignment algorithms in order to identify EST clones with overlapping sequences. 
These overlapping sequences were then assembled into contigs. These contigs were 

20 annotated by comparison with the non-redundant peptide and nucleotide databases 
available from the National Centre for Biotechnology Information (NCBI). The 
NCBI databases had been downloaded to a local Silicon Graphics Server which 
allowed the annotation of thousands of ESTs to be performed as a batch job using the 
BLAST 2 sequence alignment algorithm (Altschul et al.> 1997). 

25 

The PlELOl PIEL02, PIDES1, PIDES2, PIACS1 and PIDGAT2-1 clones were 
identified on the basis of homology with other fatty acid elongase/desaturase/ acyl- 
CoA synthase or diacylglycerol acyltransferase genes in the NCBI nucleotide and 
protein databases. 

30 
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Hybridisation conditions 

Stringent hybridisation/washing conditions are well known in the art. For example, 
nucleic acid hybrids that are stable after washing in 0.1xSSC,0.1% SDS at 60*0. It is 
well known in the art that optimal hybridisation conditions can be calculated if the 

5 sequence of the nucleic acid is known. Typically, hybridisation conditions uses 4-6 
x SSPE (20x SSPE contains 175.3g NaCl, 88.2g NaH 2 P0 4 H 2 0 and 7.4g EDTA 
dissolved to 1 litre and the pH adjusted to 7.4); 5-10x Denhardts solution (50x 
Denhardts solution contains 5g Ficoll (Type 400, Pharmacia), 5g 
polyvinylpyrrolidone and 5g bovine serum albumen); lOOug-l.Omg/ml sonicated 

10 salmon/herring DNA; 0.1-1.0% sodium dodecyl sulphate; optionally 40-60% 
deionised formamide. Hybridisation temperature will vary depending on the GC 
content of the nucleic acid target sequence but will typically be between 42°- 65° C. 

Functional analysis of bv heterologous expression. 

15 Functional characterisation of the amino acid sequence encoded by TpELOL2, 2.2 
and 2.3 will be performed under protocols previously described (Jaworski et al, 
2001(or refer to as: US Patent No. 6,307,128);Qi et al, 2002). To this aim, several 
species of fatty acid substrates will be considered: saturated (16:0, 18:0, 20:0, 22:0), 
monounsaturated (16:1, 18:1, 20:1) and polyunsaturated (20:4n-6, 20:5n-3, 22:5n-3. 

20 

Functional characterisation of the amino acid sequence encoded by PIDES1 and 
PIDES2 will be performed under protocols previously described (Qiu et al, 2001). 
To this aim, several species of fatty acid substrates will be considered: saturated 
(16:0, 18:0), monounsaturated (16:1, 18:1) and polyunsaturated (18:2n-6, 18:3n-3, 
25 18:3n-6, 18:4n-3; 20:2n-6, 20-3n-3; 20:3n-6, 20:4n-3, 20:4n-6, 20:5n-3, 22:4n-6, 
22:5n-3 and 22:5n-6). 

Functional characterisation of the amino acid sequence encoded by PIACS1 will be 
performed under protocols previously described (Kang et al, 1997). To this aim, 
30 several species of fatty acid substrates will be considered: saturated (8:0, 10:0, 12:0, 
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16:0, 18:0, 20:0, 22:0), monounsaturated (14:1, 16:1, 18:1) and polyunsaturated 
(18:2n-6, 18:3n-3, 18:3n-6, 20:4n-6, 20:5n-3 and 22:6n-3). 

Functional characterisation of the amino acid sequence encoded by PIDGAT2-1 will 
5 be performed under protocols previously described (Lardizabal et al t 2001; Cases et 
al, 2001, Zou et al. t 1999), To this aim, DGAT activity will be assayed by 
incoiporation of [l- l4 C] diacylglycerol into TAG in the presence of several species of 
fatty acyl Co A substrates that are representative of fatty acids that partition to TAG in 
P. lutheri. These include: 14:0, 16:0, 16:1, 18:0, 18:1, 18:2, 18:4, 20:5 and 22:6. 

10 

TAG extraction and fatty adds analysis 

The alga cells (2 ml of culture medium) were harvested during the experimental 
period by centrifugation at 13,000 rpm for 15 min. Fifty jig of tripentadecanoin 

15 (15:0-TAG) were added to the pellet as an internal standard. The pellet was then 
suspended in 1 ml of 2:1 chloroform:methanol (v/v) and frozen in liquid nitrogen. 
After 1 hour at 4°C, the cell debris was discarded by centrifugation and 0.3 ml of 
0.9% KC1 added to the supernatant. After centrifugation, the bottom phase was 
transferred into a 2 ml Ependorf and the KC1 rinsed with 0.5 ml of chloroform. The 

20 chloroform phases were pooled and dried. The FA extract was suspended in 0.2 ml 
of hexane, and this volume was divided in 2 fractions of 0.1 ml. The first fraction 
was dried, and the lipid extract suspended in 0.2 ml of hexane. This represented the 
total lipid extract. The second fraction was used to isolate the T AGs by hydrophobic 
chromatography. Bond Elut (Varian) 1 ml solid phase extraction columns with 100 

25 mg Si packing were used to partition TAGs from other lipids in algal extracts. This 
protocol was adapted from a method described by Yongmanitchai and Ward (1992). 
The eluate was dried and the TAG extract suspended in 0.2 ml of hexane. The 
products of these two extractions were analysed by GC as described previously by 
Larson and Graham (2001). 

30 
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The same methodology will be employed to extract lipids and fatty acids from yeast 
cells in order to perform the functional analysis of TpELO 2.1, 2.2 and 2.3 and 
PIDES 1 following the feeding of different fatty acids as outlined above. 

5 Cloning and ch aracterization of the genes PlELOl , PIELQ2. PIPES 1. PIPES 2. 
PIACS1. PIDGAT2-1. TdELQ2.L 2.2 and 2.3 

The first pass sequencing of 5,719 cDNA clones from a cDNA library prepared from 
P. lutheri resulted in the identification of 34 cDNA clones from a single gene which 
10 gives a predicted amino acid sequence that has significant identity with fatty acid 
elongase genes from a variety of organisms. This abundance of copies of the 
elongase gene indicates that it is expressed at a significant level in P. lutheri cells 
that are producing DHA. 

1 5 The sequencing of 5,71 9 cDNA clones from the P. lutheri library also resulted in the 
identification of four cDNA clones from a single gene which gives a predicted amino 
acid sequence that has significant identity with fatty acid desaturase genes from a 
variety of organisms (Figure 3a and 3b). This desaturase gene has been designated 
PIDES 1. 

20 

The sequencing of 5,719 cDNA clones from the P. lutheri library also resulted in the 
identification of three cDNA clones from a single gene which gives a predicted 
amino acid sequence that has significant identity with fatty acid desaturase genes 
from a variety of organisms (Figure 4a and 4b ). This desaturase gene has been 
25 designated PIDES 2. 

The sequencing of 5,719 cDNA clones from the P. lutheri library also resulted in the 
identification of twelve cDNA clones from a single gene which gives a predicted 
amino acid sequence that has significant identity with acyl-CoA synthetase genes 
30 from a variety of organisms (Figure 5a and 5b). This acyl-CoA synthetase gene has 
been designated PIACS1. 



24 



WO 2005/007845 



PCT/GB20O4/00JO57 



The sequencing of 5,719 cDNA clones from the P. lutheri library also resulted in the 
identification of one cDNA clone which gives a predicted amino acid sequence that 
has significant identity with diacylglycerol acyltransferase 2 genes from several 
5 organisms (Figure 6a and 6b). This diacylglycerol acyltransferase 2 gene has been 
designated P1DGAT2-L 

The full length cDNA and protein sequence oiPlELOl and PIEL02 is disclosed in 
Figures 7a, 7b, 7c and 7d respectively. 

10 

The Tp EL02.1, 2.2 and 2.3 genes were identified by screening a Thalassiosira spp 
genomic database which can be found at http://www.i ei.doe.gov/ with P1ELO 1 and 
P1EL0 2. 
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NCNCGCAGCAGQGTGCGCX3 CAAATTTTG 

GTGCGGCGACTAAGTGCGGATGACAATAAAGATTAACGGGATTTATCAA^ 
GGGQAAAACACGGAAATATGTTCCCGTAGAGAAATAATGGTCTCGTTAAA 
TTAaTTATAATCTTAAACAATTTAGTGGTCAATATAATAGACAAAAATGA 
CAAATTAGTGGACTCTTTCX3CCACCAACrCTTCA 

TGTGCCCTCCTCCCTCTCAGTTGCfTACGATTCGCrGACCTCCTTTCTCT 
CTACCGTCGGCTCCAACACCATCATCATGTCGCAATTCCTAACCAG CATC 
CCCAAGGAATGCGTAGG CACCAACGGCCTCGGAGTCCACT ACGC CGAATT 
CTCCTGCCTCCACCCTCTCCT^ 

TCTACGATCCCGTCGCCACCCTCACCTGGATGCAAGATCGTCCCATGATC 

CCCATCATCGCCTGCGTCGCCTACGTCGTGCTCATCGTCCTGGGACGCGC 

CTACATGAAGGACCGGCCGGCGTGGAGCTGGAGGAGGATTTTGGCCGTTT 

GGAATTTGAGCCTGTCGCTCTTCTCGTGGATTGGCGCGATCAGGACGGCT 

CCTCAGTTGTATTACAACTTGACGACGTATTCGTTGAGGGATAATTTGTG 

CGATGATCCGGCGGCGTTGTATGGGAGTGGATCGACGGGACTTTGGGTGC 

AGTTGTTCATTTTGAGCAAGTTTCCCGAGTTGCTTGACAC 

GTC^TTGACAAGAAGCCX3CTCATCTTCCTCCATTO 

CGTCCTTCTTTACTGCTGGCATTCCTATGTGACCACTTCTCCCAGTGGTC 

TCTTCTTCGTCGTCATGAACTACAGTGTCCACGCGGTCATGTATGGGTAC 

TACTTCCTCATGGCGGTCAAATTCCGTCCCAAATGGTTCAACCCCATGTT 

CGTGACGTTCATGCAACTTTCTGAAATGTTTATT^ 

TTGTGGCATTTTATTATTAC^GTAATCCGATTTTGGGAAAaA 

ATCAGGAAGGAGAACAATGTTGCGGCCTTTGTCATGTACGGGAGCTACTT 

TTACTTGTTTGCACAATTCTTTGTGGCGAGGTATO 

AGGGGGATGCGAAGAAGAAGAAGGTTGTGTAAAGTGAGAGATGGAATGAA 
ACAACCATCTTGTTTGGGGAAGGGGGTATTGGATAGCGGGTACCATTCAG 
TATCGTTGAGGTGCATTTAATGTTGAATGAACAAACTTGACGAGACQAGG 
GATTTTGATCTTCATGAACGAGTGGGAGCATCTTTCAATCCATTGGGGAG 
AGAGGAGAAGTGAGAGAAGTGCTACTTTGGGAGTTTGAGAGAGTAAATTA 
ATGTCTTTTGCTATGAATTGCTGCCTCAAAAACGCAACGTGCTAGCAAAC 
CTCGTTAACAATGACAAAGTTATTTCTTGTTGTATGGGACATACCACGAT 
TGTATCATAAAAGAAAACCAATTCTATTGAGTTGTAAACATCTAGAGTGC 
AGTATCGAGCAACAGCCCACGCCATCACGATACACTAAACACACATTCGT 
CTTCATCTTTACATTCTAACCACAGCATGCTGGCTCTCTTACCTCTTCAN 
NC 
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ACGCGGGGTTGCTACGATTCGCTGACCTCCTTTCTCTACTACCGTC 

CATCATGTCG CAATTCCTAACCAGCATCCCCAA6GAATG06TAGGCACCAACQGCCTCGQ 

AGTCCACTACGCCGAATTCTCCTGCCTCCACCCTCrPCCTOGGC^ 

GGAACXSCTTCTACXtf^TCCCGTCGCCACCXITCACCTGGlATO 

CATCATCG CCTG CGTC^CCTA(^TCX3TGCTC^TCX3TCCrGGGACG CGCCT ACATGAAGGA 

ccx3 gccgg cx3tggagctggaggaggattttggccgtttgg cctgtcgctctt 
ctcgtggattggcgcgatcaggacx3gctcctcagtto 

gttgagggataatttgtg cgatgatccgg cgg cgttgt atgggagtggatcgacgggact 
ttgggtgcagttgttcattttgagcaagtttccra 
cattcacaaigaagccgcrcatcrtcctccattggtat^ 
ctgctggcattcctatgtgaccacttc^ 

cagtgtccacgcggtcatgtatgggtactacttcctcatggcggtcaaattcot 

atgcpitcaaccccatgttcgtgacgttcatgcaactttct^ 

agttaccattgtggcattttattattacagtaatccgat^^ 

caggaagg agaacaatgttg cggcctttgt catgt acgggag ctactttt acttgttcg c 

acaattctttgtggcgaggtattataagcttaaggtcaagggggatgcgaagaagaagaa 

ggttgtgtaaagtgagagatggaatgaaacaac catcttgtttggggaagggggtattgg 

ATAG CGGGTAC C^TTCAGTATCGTTGAGGTGCATTT AATGTTGAATGAACAAACTTGATO 
AGACGAGGGATTTTGATCTTCATGAAOGAGTGGGAGCATCTTTCAA 
AGGAGAAGTGAGAGAAGTG CT ACTTTGGGAGTTTGAGAGAGTAAATTAACGTCTTTTG C 
AAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 2a 



MSQFLTS I PKBCVQTNQLQVHYABF 8CLHPLLQATYLPFERFYDPA7ATLTWMQDRPM I PI 
IACTAYVVIjIVWRAYMKDRPAWSWRRIIjAV^^ 

RDOTjCTDPAALYGSGSTGLWVQLFILSKFPELLDTFFIVIHKKPLIFLHWYHHIT^ 

WHSYVTTSPSaLFFVVMNYSVHAVMYGYYFIiMAVKFRPKWFNP 

TIVAFYYYSNPILGKTOTIRKBNNVAAFVMYGSYFYLFAQFFV^ 

V 



TpEL02.1, amino acid sequence from cDNA 

MSQFLTS IPKECVGTNGLGVHYAEFS CLHPLLGATYLPFERFYDPVATLTWMQDRPMI PI 

IACVAYVVLIVLGRAYMKDRPAWSWRRIIiAVW^ 

RDNLCDDPAALYGSGSTGLWVQLFILSKFPELLDTFFIVIHK 

WHSYVTTSPSGIiFFVVMNYSVHAVMYGYYFLMAVKFRPKWFNPMF 

TIVAFYYYSNPIIiGKTCHIRKENNVAAFVMYGSYFYLFAQFFVARYYKVKVKG 

V 
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Pigure lb 



CTCCTTCTCGGCGGTGTGTGCGGTCTTCGTTTTC 

CATCAGGTTTCCTAQACGTQCGQGGCCQCOTCCTTCTCTTGQQTTQQQCT 

TQCCCGCTTTGGTTTGATATCACAACAGTTACCTGGCAACCATGGA 

TAGAACGCTGCAATGGATAAGATCGGTGCCraCCATCA^ 

TCCCGATGGCAAGTTCCGTGCCGATAGAGAGGTGAGCATGAATGTACACA 

CCATGGTTGTCTCGGCATGACGGTGTCATTGGATGGTGTCAGTGCATCTC 

TCTGTTTGCATCTATTCTAAACAACACATCTCTTCACCTCGTTACCTrAC 

TCAACAACTACC^CACAACCATCATCATCXSTAGGACTGGTGGCTCTGCGA 

CTTCCGTAGCGCCATCACCATCGCTCTCATCTACATCGCCTTCGTCATCC 

TCGGTTCCGCCGTC^TGC^TCCCrCCCCGCAATGGATCCCTACCCCATC 

AAATTCCTCTACAACGTCTCCCAAATCTTCCTTTGTGCCTACATGACTGT 

CGAGGCGGGATTTTTGGCCTACCGCAATGGATATACCGTCATGCCTTGCA 

ATC^TTTCAATGTGAATGATCCTCCCGTGGCGAATCTTCTTTGGTTGTTT 

TATATTTCCAAGGTGTGGGACTTTTGGGATACCATl u rrCATTGTGTTGG^ 

GAAGAAGTGGCGTCAATTATCTTTCTTGCATGTATACCATCACACCACCA 

TCTTTCTATTCTATTGGCTGAATGCCAATGTCTTGTACGATGGTGACATC 

TTCCTTACCATCTTGCTCAATGGATTCATCCACACGGTGATGTACACGTA 

TTACTTCATCTGTATGCATACCAAAGATCCCAAGACGGGCAAGAGTCTTC 

CTATATGGTGGAAGTCGAGTTTGACGGCGTTTCAGTTGTTGCAATTCACT 

ATCATGATGAGTCAGGCTACCTACCTTGTCTTCCACGGGTGTGATAAGGT 

GTCGCTTCGTATCACGATTGTGTACTTTGTGTACATTTTGAGTTTGTTCT 

TCCTTTTTGCTCAGTTCTTTGTGCAAT CATACATGGCAC CCAAAAAGAAG 

AAGAGTGCTTAGATTGGAAAGGGGTGTGGGCGACGAGCTTTCCTGTTGAG 

GGTGGGTGGTGGAACGGAGTTGGTTTTTTGAAGCATCTGCAATATTCGCA 

GGACTGTTGCTGTGAGAATAGCTATGGAGTAAAGGTGGGGGGGGGTGGAT 

TCATGGCX3GACAGGCATGCCTAAGATACTAAGGAATGTTCATGAACATGA 

TGTTGATACTTTATTGTAAGGTACTGTTGGGAATTAATGAGAGGGTACTG 

AAAGGAGAGATG AGTGTCTGTCAAAACGCTTGGGTTAGTTGTTACTTTC C 

CTTCGTTCTTTCAGCTATAAGTCTTTGCTGAGGAGTTAATCCTAAGCTGA 

CACCATTACGTTGAACAACGCAACAATTAGCGTTGAGCCCGACAACTCTC 

GACAAAGAGGTTTTGTAGATTGTATCCCCTGGCGCAAGTTAACGTACAGG 

TCCTTCATTCACGGAACCATAATCCCATGGATGCATCCTGTGCCAATAAC 

CTTCAAAAACGTGCGTCCCACTTGAGAAAACCACTATTACGAGTTTCACC 

TCAGGTCCTGACCGGCAAAAACAATTGAATCAGCAGCAAAGCCCACAAGC 

AAGCACTTCGCGATGAGGACCACAGGAAGAGACGCTCACACCTCCCCGCC 

TTCGGACGAGCCCCACGAGCGCGNGTGG 



Pigure 2b 



DWWLCTPRSAITIALIYIAPVILGSAVMQSLPAMDPYPIKPLYNVSQIFLCAYMTVEAGF 
LAYRNGYTVMPCNHFlWinDPPVAl^IiWLFY I SKVTOPVTOTIFIVMKKWRQLSFLHVYHH 
TTI FIiFYWLNANVL YDGD I PLTI LLNGFIHTVMYTYYF ICMHTKDPKTGKSLP I WWKS SL 
TAFQLLQFTIMMSQATYLVFHGCDKVSLRITI 
SA 
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Figure lc 



AAAAAAAAAAAAAAAAAAAAAAGANAGGAAA 

CCTGCAGGTCGACTCTAGAGGATCCCCGTGTTGTCAATGTGGCGCAAGTGCT 

GGGTGGACGGTGTATGCGATTGTGGATGCGGTQATGAATAGAGACCATC 

AGTAGAAGTTTGGTTGGGGCGGCGTTGCATAGTGGGAGCTCGTATGCGGTGTGGGTTCAT 

TATTGTGATAAGTATTTGGAGTTCTTTGATACGTATTCT 

GACCAGGTGAGTTGACGAGTTGCTX3TTTAGTGTTGGTTAGATGGTACT 

TGACAGTGTGTGGTOTGGCGTTGGATATATGGATATGGAGAAGGT^ 

GGAACAATGAGACACATCCTGCGCACAGTGTCCAGAGAGACGAATCTO 

GATCATTTAAGAGTTCATCAGCTACGCAGAAGAATGGTGTAAATGATACTO 

AAAAGTTGGCATGATACTACTCAGCTTTGAAGTGCATCGGTCTGCTCA 

GGCTTTCACCAACAACGTTACCAATCCAC^^ 

AAAAAAACAGGTCTCCTTCCTCCACATCTACCACCACACGACCATAGCGTGGGCATGGTG 
GATCGCCCTCCGCTTCTCCCCCGGCGGAGACATTTACTTCGGGGCACTCCTCAACTCCAT 
CATCCACGTCCTC^TGTATTCCTACTACGCCCTTGCCCTACTCAAGGTCAGTTGTCCATO 
GAAACGATACTTGACTCAAGCTCAATTATTGCAATO 

GTGTACGGGTTATACTCATTACTATCATACGAAGCATGGAGCGGATGAGACACAGCCTAG 
TTTAGGAACGTATTATTTCTGTTGTGGAGTGCAGGTGTTTGAGATGGTTAGTTTGTTTGT 
ACTCITTTCCATCTTTTATAAACGATC 

TAGCAAGAAGAATGATGATGGGAATAATGAGGATCAATGTCACAAGaCTATGAAGGATAT 

ATCGGAGGGTGCGAAGGAGGTTGTGGGGCATGCAGCGAAGGATGCTGGAAAGTTGGTGGC 

TACGGCGAGTAAGGCTGTAAAGAGGAAGGGAACTCGTGTTACTGGTGCCATGTAGATAAA 

GAGGTTGAAGAGAGATGAAGGCAACTCTTCATGATGGTGGTCGAAGTTO 

ACTGTATGAATGAAGATAAAGGTGGTTGGACAAGGATGTCTTGGAATACGGCA 

GAGAACAAGTTGCTAATGATTCTAGAGAATGTACATTCAGACTTCGTGTATAAAGACGAT 

ACTCCGGGATCGTCACGTACCGTTTCGAAGTAGGCCATGCTCAAGACCGTGATATACTGA 

GTGCGCCGATCTATCTACTTGAAGCCATCCTTACTGTGCGGCGATCGAACAAGAATTCCC 

GACNGG 



Figure 2c 



MWRKCCSNGWTVYAIVDAVMNRDHPFIGSRSLVGAA 

MVLRGKMDQVSFLHI YHHTTI AWAWWIALRFS PGGD I YFGALLNS I IHVLMYS YYALALLK 
VS C P WKR YLTQ AQLLQFT S VVVYTGCTG YTHYYHTKHGADETQP S LGT YYFCCG VQ VFEMV 
SLFVLFSIFYKRSYSKKNKSGGKDSKKNDDGNNEDQCHKAMKDISEG 

LVATAS kavkrkgtrvtgam 
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Figure 3 A 



ACOOOOTGm^CXJCOCOTClTCaUJCOOTM 

cTACOAcaccAAoaccTTccaTOAaaAocAcccaoTcoaajc 

TOaAOTACCACCOOCOOACXnXWCCCAAOaCOCOaATaAOCAAOTTCr 

OACAOTGCCTACCTCCCXJCTaTOCGCQaAaOTK^ 

OflCQOOQQCgCTO OTOQTQQCCT 

TCtmrm\llWUaATCOOTCTCAAC^TC 

TAOGCaCAGOACTCKSATCaaCOaCAACATGOTXJCTCKNJCTO 

COACCCOQACCAQAA QGCQ CACQOCOTQCTOCQQCTCAAQCCAAOQQACQ^ 

TGCCCGG CQAGG CGATGTACG COTTTAAG CTGCTCTTCCTCGACGCGCTCGAGCTOCTCGCGTGGCGATGGGATO 
CCCCroaC<3CGCOCCCl\Jri^CACCAOCQaTO 

QAOCX3TQCACACQG CQCTQTQ CATCTG CO CQACQQTQTQ CACQQQ CTCCTTCTACCTCG CCT lHl"lUXrriHL"lU t CATCTCQC^CAACTTTQ 
ACQOCQTQGQTAGTQTQQQCX!CCAAGGq CAG CTTQCCQCQCTCTQCAACCTrCXi'IXJ CAP COG CAGQTCGAQACOAQTTCQAATQTQGG C 
QQCTACTCreiri\iOCflTqCTCAATOaAGG 
OATTOCCC?CAGTaOTOCGCACaCACATCC3AGAAGCTCGGCTT^ 

TAGCCAGCAACCXXaGTGCCMCaAGCCCCTCTTCC^^ 
GACGTCGCCGTGC^CTGGCGCTCTCCGTCGCCAGCCACTGAaA^ 
AAAAAAAAAAAAAAAAA 
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Figure 3B 

MFP8AASBGQ VABLRAAKVA 8YTRKAVDBR 
OGRDATBAPM BYHRRTWPKA RMSKPPVOSfc 
PAPPSYWLKA AALWAAV8I EOYMLLROKT 
8VZNYCLOYA QDMIOONMVL WLQBHWMHH 
ALQQLYILPQ BAMYAPKLLP LDALBLLAWR 
LPLWLQPTVH TALCICATVC TOSFYLAPPP 
TSSNVOOYWL OVLNOOLNFQ IKHHLPPRLH 
NLS8MLQHMO KMOTRPOABK OOKAB 



PPLTIVODAV YDAKAPRDBH PVQAHPV8LP 
DA8BKPTQAO 8AYLRLCABV NALLPK0800 
LLLSVPLOLV PAWIOLNIQH DANH0AL8RH 
LHTNDVDADP DQKAHOVLRL KPTDOWMPKH 
WBOBKISPLA RALPAPAVAC KLOPWARFVA 
PI8HNPD0V0 8VOPK08LPR 3ATPVQRQVB 
HSYYAQIAPV VRTHIBKLOP KYRHPPTVOS 
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GCACOAGGGTGCTGCTACCTGCTQTACGTCTCCCTCGGCTCGATGTACAT 

CTTCTGCAACTTTGCCGTGTCGCACACGCACCTGCCCATCGTTOAGGCCG 

ACCAGCACGCCACCTGGGTTGAGTACTCGGCCAACCACACGACCAACTGC 

GCGCCCTCGTGGTGGTOCGACTGGTGGATGTCTTACCTCAACTACCAGAT 

CGAGCATCATCTGTTCCCGTCCATGCCGCAATTCCGCCACCCGACGATCG 

CGCCGCGC GTCA AGGCGCTCTTCOAGAAGCACGOGCTOCACTATGACGTG 

CGCGGCTACTTTGAGGCGATGGCCGACACGTTCATGAACCTTGACAAGGT 

CGGCAACGCGCACGAGCACAACCATTAGGCCGTAGCCGCTTGGAAAGAGG 

CCTCCTGCATACGCGGCGACGCGTCGGCGCGCGGCGGCGTGCACGGGAGC 

ACAAAGTGATGGATGGACCTTGGGCGACGCCGACGGCCAAGGAGTGGTTG 

TCTCTGTCGTCGCCAGGGCCCAGGAGCCCAGGGGCAGGGTTGCAGAGCTT 

GGGCGCGATTGGAGGCAGGGCCGGGCGCGTCGGCGTTCGCGAGTCTGGCG 

AGGCGCTCTGCGAGCTCTGCACGACTGCGCCCAGAGGCGTGCGCGCGCGC 

GCGAGTTCCAAAAAAAAAAAAAAAAAAAAAAA 



Figure 4b 

ARGCCYLLYVSLGSMYIFCNFAVSHTHLPIVEADQHAT 
WVEYSANHTTNCAPSWWCDWWMSYLNYQIEHHLFPS 
MPQFRHPTIAPRVKALFEKHGLHYDVRGYFEAMADTF 
MNLDKVGNAHEHNH 
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Figure 5a 

GCACGAGGCCTCITCGGCrGGGCGCTCaACGACQCGCTCGCCAAGTATGA 
CAAGGGCGGCGTCGGCCC(XjGCTTCCTGTACAACGCGGTCGTCTTCTCGT 
CGGTGCAGGCGCTGCTCGGCGGTCGCGTGCGCATGATGGTCGCCGGCTCC 

gcgcccctctccgccgacgtgcagaagtttgtgcaatcgtgcttcaacgc 

gccgcttcgccaaggctacggcctcaccgagacgtgcgcggcgacgacgc 

tctgcgcgctgcacgacaacacgccgtcgcaagttgggccgccgcaggag 

tcggcgtgcatcacgctgcgcgactgggaggagggcaactaccgcaaccg 

cgacgccaacgacccggccatcgggatgcggcgcggcgagatcctgatcg 

gtgggcccgccgtctgcctcggctactacgtgaacgagcgcgcgcccgac 

gcggacgtggtgaagcgcaacgcggaggactttgtgacgatcaacggcat 

gcgcttcttctgctcgggcgacatcggccagatcacgccgagcggctgcg 

tgcagattatcgaccggaagaaggacctcgtcaagctgcagcagggcgag 

tacgtcgcgctctccaaggtggagaacgcgctcaagaactcgtcgtacac 

gcagatcccgtacgtctacgcgctctcatccaagagctactgcatcgcgc 

tcctctgcccgcagcacgcggcgatccgccagctcgccgcctcgctgcag 

atcagcggcaaggagctttccgagctgtgcgcgcacccgcagatcgtcgc 

ggccgtgctcaaggacctgcaggcgcagtgcaaggcggccaagctcgcgg 

gcttcgagacgccgagcaagctcatcctcgtgtcggacgagtggaccgtt 

gagaatgacatgctcaccacgacgatgaagatcaagcgcaagccaatcgc 

TGACC GGCACGCGAGCGAGATCAAGGCCGTTTACGTCTGAGCCCGCGCCT 
TTTTGTACAACCTCGAGAGCGCCACTGTCTTGATGGCGCGCGCGTGCTGT 
TGTGCAGGCCGTCGGCATTGACCGCGGCGCTTGAACGCAAGGCAGGCGCA 
AGGCGCGGGAGGGATTGCTGGGGATGGCGGCTGCCGCAGTTGCTGAGCAG 
AAGGCAGTCTCCGGCTCTCGACAGGTGGCGCCCGTTGTGCAGAATGTTCG 
CAGCCCCTCCCCCCTCGGGCGGCTGCCATTCGGGGCAGCGCTCGCACATG 
TGCTGCGCTCCGCAGCCGCACGCCACGGCCACCAACGCGTGTGCCTGCCG 
TCACGCGCCGCGCCCGTGGGAACGACCGTTGCCCTCGCAC 



Figure 5b 

ARGLFGWALDDALAKYDKGGVGPGFLYNAVVFSSVQALLG 
GRVRMMVAGSAPLSADVQKFVQSCFNAPLRQGYGLTETCA 
ATTLC ALHDNTPSQ VGPPQES ACITLRDWEEGNYRNRD AND 
P AIGMRRGEILIGGP AVCLGY YVNER APD AD V VKRN AEDF V 
TINGMRFFCSGDIGQITPSGCVQIIDRKKDLVKLQQGEYVAL 
SKVENALKNSSYTQIPYVYALSSKSYCIALLCPQHAAIRQLA 
ASLQISGKELSELCAHPQIVAAVLKDLQAQCKAAKLAGFETP 
SKLILVSDEWTVENDMLTTTMKIKRKPIADRHASEIKAVYV 
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Figure 6A 



ACTQCQTOTACACAQCATQQCQGCTCGCGCQQTT 
GGCXnTCGTGCAQATCGGCGTGTGGGCGCTCACT 
GTTCCACTGGAAGGTGACX3CTGCCX3CTGCTXX3CCCOT 



QTTCACQTTCATGCOCAQQOTCTACCQOCAOCQCOTTCACOTOCCAGCTQGCCTCQAO^ 
CQAGGAGCAGATCATCCTAGCGCTGCATCCGCACGGCrCGA 
CCT03ACXK3CC^GCTGCTCX3ACCTACTaCCCGCGCTGCGCGGC^ 
GGCGAGCGTGCTCTTTCGGCTTCCCATTOTGOT 

CGACGCGCGCCGCTCX3GTTGCCGAGAGTGCGCTGCGTGGCGGCTACTCAGTCGGCGTACT 

GCCCGGCGGCGAGCAGGAGCAGCI^GCACGCGCTACGGGCGCGAGTOT 

CAAG03CTTTGGCTTCGTCAAGCTTGCX3CT 

CX5TGTTCGGGTGCGTCGACCTGTACCA<^CTTCATCCCTGCTCTTCT 

GCTCGTGCGCTCTCTCGGCGTGTGCGTGCCCGTGTGCTTCGGAGCGTGGGGCGTGCCCAT 

GGOX:CGCTTGCrGTGCCGCrCAACGTCGTGATCGGCCGGCCX^ 

CCCTGAGCCGACCGATGAGGACGTCGCGCGCGCGCTCGACCAGTACATCGCCGCGCTGCG 

CGCGCTCTTTGACGAGAACAAGGCGCGCTTTGGCTATGCCGACCGCGAGCTGGAGGTGTG 

CTGATTGTGAAGAAGTGTCATTGAAGGTCGGCGTCAGCAGGCGCACCGCGCACCAAGCCA 

CTCACGTCTTGATCGCTGAACCGCCGTGAATCATGC^ 

CAGAAAAAAAAAAAAAAAA 




iTTTTTGGCT 
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Figure 6B 

MAARAVDALV VSAFTAFVQI GVWALTPVQI AWA1AFHWKV TLPMALYLA SYLDGAEVRV 
KRVRAWPAFS RHFWLFTFMR RVYRQRVHVP AGLEAEEQII LALHPHGSMA DYRAILDGQIj 
LDLLPALRGK MRWI*AASVLP RLPIVRBLTL WTGCIDARRS VAESALRGGY SVGVLPGGEQ 
EQLRTRYGRE SVYLRKRFGF VKLALRFGVP LVPGYVFGCV DIiYHTSSLLF SAREWLVRSL 
GVCVPVCFGA WGVPMAPIAV PLNWIGRPI KLPRNPEPTD EDVARALDQY IAALRALFDE 
NKARFGYADR ELEVC 
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Figure 7a 

<K3CAOTAGK3<H3GAOATO^ 

CCGGAGCGTGCAAATGTCAAGATGTCCQAGGCGCTGCGCGT^ 
TCACAGCTCX3(^GATCCTOTCOGCCX3CGCTOCTCGTCACGaCOT 
TCX3CXK^CCTCGCCX3AG^TCTGGCGCTCGCTGC^ 
GCCXm3CTGCTTCTCGGCrACTACGCTCT 

GCAGCTGCGCX3ACX3ACAAGGACGACGGCAGCCTGAGTG<X3ACGAG 

ATTGCGGCAATTTTTGCGACGAGTCGGTCGACreCCAGATG 

OSCTGCTACTTCCX^CCTGGC^TCOT 

CGCGCGCAACK3AGTTCXJAGACTGTCGTCTTCAC 

(XSCGAGATATCGACATCCTCGTCGTCAACTGCTOTCT 

ATGAACGACTACCAGATGAAGGACTCCGTACAGAGCT 

CATCTCAATCCACCTCGCAAAGGACCTGCTK3CAGGT 

AGAACATCACGCAAAATTTTTACCAGGGCAACGAAAAGTCGATGCTCAT^ 

GGCGGCGCCGCCGTCCTCCTCTCCGGCCGCCACGCCGACCGGOT 

CX3TC(^CACGCACAAGGGCGCGGACCCGGACGCGTACC 

ACGTGGGCGTGCGCCTGTCGAAAGACX3TGATGGAGTGCX3CCGGCGCCGCGAT 

CTCGCGCCTCTGATTCTGCCCGTTTCTGAGCAGGTCCGATTTCTCGCAAACTAOT 

GCGAATGAAAGGCX3TGAAGGGATACGTGCCGGACITCACAACGGCCGTGCA 

GCGGGCGCGCGGTGCTCGACGCGCTGCAGG03AACTTGTCGCTCT 

TACTCCCTGTGGCGCTGGGGTAACGTCTCAAGCGCCTCAGTCKK3 

CGGCCGCATCCGGCGGGGCX^CAAGGTGTGGCAGATTGGGTTTGGCAGCGGCTTCAAGTG 
TCTGGCGGGCXH^CCGAGCGATGCCCTAGCTACGCCGGOT 

AGTCACACTGACGAGTGCGGAGTGACGTCTGACXKJCTTCCCCCCCCCGCCCACCACCT 

CITCACTCTCACTCAATCGCGCGGCGGCCAGAGCAGGAGCGCGCTCGTGCTCX3 

CCTCX3CGCCGCTOTAGCGAGCGCGCGTCCATGAGCGGCACGGACGCGAAGCGGA^ 

GCAGAAAAAAAAAAAAAAAAAAACTCGAGACTAGTTCTCTCT 

GTGTGCATGCTCGCCGTCACCGGCATGCTTGTCCAGGAGGTGTACTCGTO 

CAAGGCGCCGACGCCGCTCGGCGCGCTCTCGACCGTGCCGGCGCTCGGCC^ 

TCGGCATCATCGAGGTGCGCTCXK3CGAACTACCAGGGCC^ 

GG 
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Figure 7b 



MAAPTSPYGA ESPRAAYAYP BRANVKMSBA LRVLDEGVHP LVIHSSQILA AALLVTAAVN 
HFPKITVADL AEIWRSLiQID VAYAFALTAV AVLLLOYYAL RHPRPVYLVD PATWQLRDDK 
DDQSLSATSD PFRSTITDOQ NFCDE8VDPQ MKLFERNQIS ERCYFPPGIR AYRKGERDPD 
FSMAAARKEF ETWPTTVDE LLAKTOVKPR DIDILWNCS LFNPTPSLAA IVINHYQMKD 
SVQSYSLGGM GCSAOLISIH LAKDLLQVYP RKRALVISTE NITQNFYQGN EKSMLISNTL 
FRMGGAAVLiL SGRHADRRVA KYQLLHTVRT HKGADPDAYR CVFQEEDKAG HVGVRLSKDV 
MECAGAAMKT NI8VLAPLIL PVSEQVRFLA NYVARKWLRM KGVKGYVPDP TTAVQHFCIH 
TGGRAVLDAL QANLSLSDYY LEPSRYSLWR WGNVSSASVW YELDWLEKSG RIRRGDKVWQ 
IGFGSGFKCN SAVWRACRAM P 



i 
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Figure 7c 

GCACOAGGCCTCGTGCCQAATTCGGCACOAGGOGQCGCTGTGGTCGTOGT 
TACCGACGTACGACGAGTITGTCGATGGGCTTTCGTTCGTCGACCX3CGAG 
AAGATOGGCGTGCACATGGTCGA(XAGGGCGTGATTACCTCTGCGGAGTG 
GGCGGCCATCTCGGTCGACAAGCACATGTCCTTCTTCTCCGACGCGGCCG 
AGTTCACGGGCGACCACTGGATCATCCCGCTCGTCGCGGTCGCACTCTAC 
CTCGTGATGATCGTCGTCGGCCCAATGATCATGGCCAACCGGCCGCCGCT 
CCCCGTGAATGGGCTCGCCTGCGCGTGGAACTGGTTCCTGGCCGCATTCA 
GCACTTTCGGCGTGGCTTGCACGTGGCACTGTATCTTCACCAGGCTGCGT 
AGCCGCGGCTTCGAGAGCACGACGTGCGGCAGCGCCATGTTCATGTCGCA 
GGGGTACGTTGGCTTGGCAATGCTGCTCITCATCTACTCCAAGCTCTTCG 
AGTTGATCGACACCTTCTTCCTCATCGCGAAGAAGGCGGATGTGATCTTC 
CTGCATTGGTACCACCACGTCACCGTGCTGCTCTACTGCTGGCACTCGCA 
CTCGGTCCGGATACCGAGCGGGATCTGGTTCGCCGCGATGAACTACTTTG 
TGCACGCCATCATGTACTCCTACTTTGCGATGACGCAGATGGGTCCGCGC 
TACCGCAAGCTCGTCCGGCCGTACGCGCGGCTGATTACGACCCTGCAGAT 
CTCGCAGATGTTCGTCGGCCTCATCGTCAACGGCTCGATCATTTACTTCA 
CGTCGCTCGGGCACGCATGCAAGTCGAGCAAGACGAACACGATCCTGAGC 
TGGCTGATGTACCTCAGCTACnTGTGCTATTCGGACTGCTCTACCTGCG 
CAATTACATCCTTGGTACACATGGCAAGCCGGCGGGCAAGCGCGCAAAGG 
GCAAGGCGGAAIAGTGCAGGGGCCGGGGAGGCGGTGCCCACCCGCGCTCG 
CAAAGCGGTCGCGCTCCTTGCCGAGATGCGACGAGAGTCGAAGAGGTGAA 
ACCTCCTIAAAATAATGCTACrCCTAGATTTTCGCTTTGTGCTTCCGTAI 
AGATGGTCAAGCC 



Figure 7b 
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